General Disclaimer 


One or more of the Following Statements may affect this Document 


• This document has been reproduced from the best copy furnished by the 
organizational source. It is being released in the interest of making available as 
much information as possible. 


• This document may contain data, which exceeds the sheet parameters. It was 
furnished in this condition by the organizational source and is the best copy 
available. 


• This document may contain tone-on-tone or color graphs, charts and/or pictures, 
which have been reproduced in black and white. 


• This document is paginated as submitted by the original source. 


• Portions of this document are not fully legible due to the historical nature of some 
of the material. However, it is the best reproduction available from the original 
submission. 


Produced by the NASA Center for Aerospace Information (CASI) 




Mic 3 X>prorosr.or Utilization in Search & R.-r>cui ‘'Tissions 

FRiAI. W-:P 0 RT 


Paul E. SchmU 
Ccxie 932 
NASA - GSFC 


Introilu ction ; 

lx>.sition of an c3nt?r<^oncy tr.inrjnittcr may lx; deatominod by measuring 
the shift of tho distress signal as received by an orbiting Mtellite. 

niis rcx^uires tbe cxirixitation of on initial estirato .and refinement of this 
estiirate tJarough .an iterative, nonlinear, least-stiuaros estimation. 

A version of the above algoritlun was iinjlcmenti.^l at Toddard Si>ace Flight 
Center (GSFC) .and tested by local ijig a l-ran:anitti.'r on the premises .and obtaining 
observations from a satellite. 'Hxa cx>nputer used was an IE.M 360/95. Hae ix>- 
sition was delermiinad within the desired 10 km r.adius accur.acy. 

■nie purpose of tJiis project is to dctoiirine the feasibility of per forming 
tlx? same task in real time using micrcprocessor techixDlogy. The least square 
.algc*rithim was iipjle.'n.'ntixl on an Intel 8080 mierToprooessor and the sane experi- 


irent was run as at (IStXT. 

Tlie ri:!sults iixlieate tliat a micrxpjoxessor c.m e.asi ly match the 1E.M im- j 


pl«ronbrition in .accuracy .ind Ixr [viTorinM inside tho the limibiticns set. 
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Microprocessors : 

Time is nn inplicit restriction in any iK?arch and rescue mission. H>e 
use of satellites and computers is dictated by tJiat time limit. Ibe use of 
a big oonputer to determine the positim prosipposcs cormunication between the 
;^tellite and the ccxrputer. This ooninunication introduces a time delay since 
tin satellite is not always witliin radio visibility of an installation that 
ixxs.sesses botli the oonmunication and coqxating power for this problem. Fur- 
th»?niore the result has to be forwar»led to a aDiimand ccnU»r to do the dispatching. 

Micrcprocossor utilixation can alleviate this situation in two ways: 
by giving cheap car^xiting povAar to cixmanic.it ion facilities or by incorporating 
the eexputing power in the satellite itself thus eliminating this ooiTinunication 
otairialotely. 


Microprrxaessors offer light wight, small volume, low power processing. 
Tlioir speed is im^aroving rapidly and their cost is going df)wn. They -are the 
lojical ciKjice for a satellite search ajxl rescue system if Ux.y c^m pcrfoim. 



Michino Ccrj figuration: 
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strictly speaking there arc three microprocessor configurations in 
this project which we are going to discuss individually. 

• Development system 

• Minimal execution system 

• /V-'tual field configuration 

Initially our developnont system consisted of an ros-80 Intellec micro- 
ccxiputer by Intel with 16k bytes of HAM inerory tind a resident RDM monitor. 

Most of the floating foint p.icka(jc .as dowlopixi in rtichine langu'ge on that 
system using the monitor's limited hexadocijral editor and debugger. The need 
for more sophistication became apparent. After several failures in exploring 
alternatives (as fancy as hooking to a PDP 11 through a teleplwne line 
for more storage) we v^re able to acquire a dual floppy disk drive by Intel. 

A spare line printer was attached to tlie system with minor hardware modifica- 
tions and 16k bytes more RAM were added in order to support DOS. The enhanced 
system had the powar of a mini-conputer in software (assembler, editor, li- 
brary nunager, linJeage editor, leader, and a sufficient file manager) at a 
speed which was slow but acceptable. The floating point pac)cage was converted 
to assembly language, and two more paclcages w're developed: the I/O pac)cage 

and the matr'x nunipulation package. Unexixx;tod help came from the use of 
ICE-80 (In Circuit Bmilator) , designed for a different application, as a 
poi>«?rful symbolic debugger substituting for the monitor hexadeciiral debugger. 

Out of this final version of Uie dovelotTrent system only a limited 
amount of resources were used for the final nin. Those define the mininal 
exfxnjtion systrm. The disk was - -nly used for input <?f data. The essential 
[Tcirts were: 

• 'H-ie a>U c.ird 


• 16k Bytes of niiroiy 


• 'Hkj I'onr.olc liovico and it::5 intorface 

• Pcwor supply : 12V, 5V, -5V, fj’-ovuid 

Additionally, tho lino printor was usod to pnxlucc a hardcopy version of 
the results. 

He actual field confiijuratioa would bo Uio sane if tlx 2 nvichino v;cre 
locatfxi on Uv 2 yround. Sone kitid of conniuni cations cquipront vould bo ro- 
quired to provide tlio data inixit a.nd, nviybe, start tlie run autmutically. 

H>e configuration would be different, though, if the machine v^-re located on 
tho satellite. Tire rixiuircnonts ^or tlx? Svitollite con figui at ion wxdd be: 

• 'Iho tTU cai-d 

• 16k bytes of nvjiory 

An interface tfurt can load tlx? infoniotion in nojiory 

• A means to coninunicate the result to the world 


sui'ply: 12V, 5V, -5V, grouird 


Floati ng Poin t Pac kngo ; 

Biinod on estimates of the numlx^r of o[X'rati(ans rrquinxi \vc wore in- 
clined to thiiik that any floating point oix'rati.ons vould have to be porfonnt?d 
lay h.\rdware ojk! not by software since estimitod times became proliibitivo. 

Iliis floating point package was developed to lielp us count the actual number 
of oixarations rather than peiform them in an actual situation. Tlae final run 
proved our estimates wrong and the package gained new ijnx>rtance. 

nere cure a number of representations of floating poing numbers dif- 
fering in accuracy and range as a trade off to tlie nuinbi'‘r of bytes rix^uired 
ixu* numbL>r. Itie one used \*is U>e ANSI fontrrt for tXDi^TRAN whiirh ha{.^xns to 
bo ijiirlcnented by hardware as an option in TliM (?oii^>uti^rs . It consists of one 
sign bit, a seven bit exixnont (excess 64), and a 24 bit mantissa of hexa- 
tlocimal digits. 'Hie accuracy is 6 hexadecimal Ugits or approxinately 7.2 
dixriiTul digits. Sixxrific operations were not timed altlough a noro general 
timiixj an.ilysis apixxirs in a later :-.ix:lion. 'Ihis fonnit was clioscn as oj^ixid 
to Uie bCD foniot because the space reguiix'jionts arc lover for the same 
.unount of piocision, which in turn n'dixrcs execution time sliglitly. A man- 
tissa of biruiry digits was not used txxrauco of the fivtjuent need for noniali- 
zation. 

Afldition and subtraction take exactly the s,mie time, wh'rcas multipli- 
cation is appioximrti^ly e^jual to 22 addition I’ld division is approximately 
60 a(idit-ions. 

Mult.ipl i cation produces a 48 bit result jimtissa which is then nonivalizod 
and laxindud to 24 bits. Tlris preserved Uie nuni^ar Ot sitjniric.mt tiigits, or, 
viixaxi from a different tangle, is tlio same as a dixilrlo pt*-visic\n multiply if 
tlx? two .irgnu'nts rx're exi^andixl with /eio fill. 

l')ivision pieiera/cs the significant digits again ly cxixinding the nv‘«n- 
tis;wi of tire dividc'nd to double precision .urd ia?:-nlts in full single prex'i- 
s.ion result. Nixmil i’/nt ion and roiuuling txxrur as in mult iid i cat ion. 


TWrauMcy is tlius proj'A'iwxl t o t nio siivjlo pivcision Ihroivjliuut in a 
nuu'i ically stable nuiuK'‘r tlie loivjih of the nunix'r to 4 bytes. 'I1v3 

cost is c.>q3i'nsive nultiplication and, cxixviallu, division, lliis dicL^U'S 
a pnxjranining style wlvivly division is awid'd ludess it is absolutely mx>?s- 
sary. Tte benefits, on the other hand, are numerically stable inplomenta- 
lions wlwsc lesults notch Uk* tlouble pxccision to tlxi extent iTOssible as 
will Ixj ;xvn wlK'n Uxo results of Uio nm aic .uialy/ixl. 

'Hxe s*-]u.ire nx^t function w^xs invlemL'nlixl l>y using a variation of Heixin's 
roniiula lv\!kxl ixi tlx* obiJciViit ii'ix tliat. tlx' mud iiso of .my lli\»tin<] loint mun- 
K'r will have a valix* of 1/16 to 1 (int»'ipn'i«\l a:; a fiactien). As a fii>;t 
guess an approxinvition to a straight line ixxuux't ing tlxo tv%o end ix3ints is 
mule. Exix'rincntally, six iterations vsciv fovuxl ntxx*:-.rHuy to pjvxluco ixn 
a(.x'urato ivsult. A better first guess txxuld invrove tJat si(jnificontly, but 
t ini'? a'»nstraints did not allow us to pursue tlat diixx'tion. 

Finally, injiut and output of flailing ix>int mmilx*rs t\nn out a niK'h 
iii>ie serious task tJvin first ex^xx-dixi. 'Ibe ininil mntine uxx>jni/Axl mmix'rs 
with a maximum of ten intixjor and ten fract ion tligifs. 'Uns pieviHl muu 
than sufficii'nt for our noixls. 'Hh? oulpiiL ixnitiiu* pnxle.ccs .i rigid s.i’icnti- 
fic fotmit with 10 fraction digits. Mvn intci^nct iiwj tlxi lesults it slxmld 
K* kept in mind thit at in^st only 7 are significant, 'llie fonivd w.is ro- 
ta ini xl in caiX"* of future ojqvuxsion of the mint.is*a. IVdh the inpvit ami output 
uHitinos ixnild bo U->tter, but sim*o their fvuict ion is only tangi*ntial to the 
inujtvt at h ukI tlu*y va.*rG kept t*n tla luic fiuict ii^nal level. 


Mi trix Operations : 

All n« trices in the system are defined as two dimensional, including 
vectors. Hie first two bytes contain the number of rows and tJic number of 
columns in the particular matrix, respectively. Hiis effectively limits the 
number of observations to 256, Vectors have one of their dimensions identi- 
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cully cijual to 1. The next two bytes contain the address of tJic first byte 
tJiat follcjws Uie last byte belonging to the matrix. Adjacent eli:ments in 
a row of the matrix are stored as adjacent floating point numbers in marory. 
Rows are stored seqiktntially starting from the first row in the fifth byte. 

In an effort to minimize tlic numlor of aildross calculations in the least 
squares algorithm the APL program we vore suiapli*?d with, (LSQ) , was convtarted 
into RlRl'RAN. Hie calculations inwlved in tlie residual cxijuations WL^ro all 
giouixd together inside one big loop. Hxj advantage of such a schere is tliat 
once an offset is calculat*?d it can l-o ustxi to address all the needed ehmonts 
of tlio mitricos involved in the calculation, hhon tJae timi came tlxiugh, to 
inplurmt it using 8080 assembly language, it become all loo apparent that 
tJx?re were too many addresses to keep track of aixl too fi?w registers to lielp. 
Hie re fore, due to tlie limitaticn of addressing ca^xabi lilies, routines were 
inplejiv'ntcd for tlie various nvitj ix o^x-'rators in APL. Hiis resulted in veil 
structuied .and vany efficient cotie, the stylo loing dictated by the instruc- 
tion set. 

A minimum number of matrix utility routines was necessary. Matrices can 
Ua created by six-^ifying tJieir dimensions, they c-an lo filUxl with zeros, they 
can he read from a device, Uvy can be novtxl (copied) in storage. 

Hvro are four clas.'^cs of operat.ions by which matrices nway be altered 
involving the following argumaats. 

• a aaist.int and a matrix 


• a vector and a matrix 


• two matricjcs (plus ix)ssibly a result iiotxix) 

• one matrix (for example, inversion) 


In our particular aiaplication tJx?re was only one inversion of a 2 by 2 
nvatrix involved. A sinple algorithm derived from Euler's method is inple- 
mented using fixed pivots. .Execution time and teirix)rary storage are opti- 
mized. 


llaviiuj dovoloixxi tJio tools tJvit va'io tUrwussixl in provioun }'>oct ions Ukj 
actual ini-JlcnKnlation was stxaicjht forward. Ftor roarxins alroaily nr*ntiomx3 a’ 
ixxilino was written to match tlw KSQ nxitino* dewlc^xxi Ijy Dr. Mirini aliaast 
stalen\'‘nt by statimxit. 'Hxj cx>rrosixTiuV''nro is iixiicattxi in tlx? sounx* pro- 
gram by ko(?ping track of tfx? APL .statement nunlx'rs. 'Dx? array nonr*s vx?re 
k»?pt Ux? s.anv! as rnch as ix)ssiblc ami only one .ullitional tonixarary matrix 
was nxiviircd. 'Hx? program was written for a maxiniin of 100 ohservatiot?s. 

All nvatrix oixuatiixis as Xst"*!! as tlx? s*]uan? iax?t ki op l r.u*k of U>? calls to 
tlx? flcvatincj point loulines. 

■rix? wlxalc pack.ujo niakes limited use of t\vo monitor loulinos, vdiich c.ua 
easily Lo el imiruitixi. llx? rea.vx?n llx'y am llx'm in Kxmuiu? luiflw.u'o was 
Ix'iixj devi'loixxi in lujclaine langiv\go aixi tlx? m>nitor puwidtxi a lot of netxltxi 
help. So, esst'nt ially, T.SQ c.an lx? ram txanidt'lely iniV'ix juk'nt ly . 

'Ihe sixici' n>|uiir'n»’nt.s for this i-\art imilar i\u\ w.is ai^na^xinutely 16k 
bytes, out of which 4k cxxild 1»? in UM .uhI 1?x in K’iM. 'Die exact numlx'rs 
are as fcrllovsis: 


(\»le; 3('*i6 l>^tt'S 

lArt.a; 10 365 iTylcs 

Stack: 100 bytes (aibi t lari ly) 

’A)l.il: 14121 byti'S 

Tneanyorat ixl into tJr? jvackage w'le l\?ur ix'iunt iiuj nnjliiv's that, kept 
ti.ick i?f the numlx'r i?f aillil ions, ‘aibt ract itxis, lailt ipl ieat in^s .\iiu ilivisi(?ns 
I'vluin'd (luring eav-h iti'vatii?n. Hx' ii'sults will Ix' analy.axl in the luwt mx:- 
t ii?n. '11?o actual imi?lem'nlat ion \%x'uld not iixjuire tl>':\. nnitincs. Mhe exnuitiirg 
(?verh(\ul to each aritlinx't ic I'ix'i at i(.m is ai>i'nxxiiuiti'ly (xjual ti? half th* tim? 
of an .uldil ion. 


* .‘-'•x-* Aj iX'ndix C. 


T ntor prctln g tlx? R?.sul ts ; 

IliG final run oonwrgcd aiKi yieldcxi five digits of acxruracy. If conver- 
gence is defined as a ratio of tvo succesive residuals being close bo 1 ’ 
(in absolute) it was attained at the ninth iteration to within 0.00001. 
Cai^xiring these results to tlie run at CSPC (run at double precision, or 16 
digits of accuracy) we notfe the 5 digit accuracy of our result. 

Numerical analysis gives us enough tools to justify the loss of twa signi 
Ficant digits in the course of the iterations. The main source of error ap- 
pears to be the subtraction of the estimated range rates from the actuals. 

'Hie subtr^lction of the average residual txjuations i:ou1d could contribute to 
the error as vxall, 

nie measured execution time for this particular run was 62 seconds per 
ito-Tution. n>e microprocessor used was an 8080A by Intel. Adjusting for 
couiting the number of operations the true time bccores 61 seconds. Ihe 
8080A Ivis a cycle time of 2 microseconds. If this system wore actually 
inplt nentod, the 8030A-1 Q'U caDuld be usixi wiiicli orfers higlior spued with 
cycle tine of 1.3 microscx:onds which could Ining ('xocution time down to 40 
seconds f >r each iteration giving approximately G :.anutr>s to roach conver- 
gence. This figure is derived with no modification of the .software. Since 
it falls within our difinition of “real time", which was around 15 minutes, 
it is definitely a workable solution. 

AnotJr?r alternative is, of courtie, to use haidwate floating point units. 
IVo units that we are familiar with indicate a disparity in execution times 
of several orders of iragnitule. Tlieir six'cifications apjx^r in Aiipondix B 
for tJie pur^xisGS of the following analysis, 'typical* execution tines for 8 
digits of precision of the North Star CoiTputers, Inc. IPB vmit wore inaed. 

Our systim indicated the following froqurncy of floating point operations 
for each iteration: 


Alditions - 3137 



Sub! r.ict ion3 - 672 


Multiplications - 2382 
Divisions - 940 

WiKsn trying to cori^uto the tiine it wjulfl take to o.-ocute those instruc- 
tions wo noticed that the time it tikes to access liardwaro floating point 
unit is more than twice than the time it takes to do the calculations. Runely, 
we e.uTti up with the folloisdng numbers; 


TIME (SEC) 

0.35 

0.625 

1.175 


finrosB 

ix'rronn tive c^x?rations 

ini>ut <ii'>d output tlie nuiiix.T form 
ITB (8080A-1) 

total tinr» rixjuirod 


Thi'ivfore, ase of Ivinkvare luiits mike it i»!-;;ible to docrea;5e U>e execu- 
tion time by one order of nvKjnitvjde. 
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Future Research: 


The paranctcrs that hive to be (^timizod in the search and rescue 
mission consist of the accuracy of the ix>sltion estimation and the time in 
v»iilch it is performexi. Proving the feasibility of a mic’'JiJrocx!s.sor ‘ 
iirplcsncntation is far fxxm devising an cptixival algorithm. 

If the nailinear regression method is utilized there is a lot of 
room for Inprovcnrint in the inital estimate, a quantity that can affect 
tl« wliole outceme of tlio iterations. Several methods tliat are suggested 
in Dr. Marini's paper can be explored. Furthermore, siiK:e the data 
collection tahes an aipreciable amount of time an aluvoriUm shovild be devised 
in which an estimite is ui>jra<l(x 3 with i-ach ina^iuing datum. If Uiat algoritlim 
is good enough then the ostimite could 1x2 Uxa ii siilt itself. 

A further enhancau'nt on tJie calculation tiem can Ik achieved tlirough 
ixirallelism. It can appear on two levels: 

• The implementation of the least stjuaros algoritlin 

• Ihe grouping of data 

'fhe least squares algoritltn iray be broken into parallel subtasks that 
can be perfonuxl by difCermt processors in parallel, esixx:ially floating 
point operations. 

The data may be groupxi in clusters on which tlae least squares 
algoritlim is aiaplic?d. 'Die estimate ptovidid by ».ach cluster is then proces:jod 
through least squares estimation itself. Hiis naathod could be applied .at 
data collection, time too. 


AlJ^mndix A 


• Stimplo 


run at GShXT 
run* at Columbia 











rHE .FESULTIliQ POSITION IS: 
-0. C.*:«24835SlC.E+03 V= -0. 


“ sample run at Columbia 
0. •»3933*r':0l37E+G4 


■pESIDUfiLS « 0. 115‘1<515402E'+01 



- 0. 41V''6S226S2E+04 


THE RESULTING POSIT fON IS: 

X= 0. 6216947555E-»^03 V- -0. 49123i7547E-*-04 2 

' o ' 

•* HS”F:ES I DUALS" a ~10ri050582067E-*-0i 


IE RESLILTING'POSITION IS:“ 

0. S4e45872ilE+03j V- -0. 4839673930E+04 Z-j O. 4O60S8833SE-^04 

5 ^- 9 

IS "RES I DURLS~^'“~0ri3S9477729E+00 




■RESULTING POSITION ISi 

0. 947070312OE+03 V= -O. 4838834490E+O4 7.~ 0 4033941268E+04 


RES I DUALS — O: 070640211 1 E+ GO 


RESI-ILTING“PDS1T10N IS^ 

0. 9.-37SJ 50680E+03 V= -0. 4838407516E+C4 Z- 0. 4023446G83E+04 



RESIDUALS =■ "0. O65O79631SE+00 


RESULTING* POSITION IS:*" 

0. 9999647140E+03** V= -0.M8383789O6E+04 Z- 0. 4G202331S4E+O4 


RESIDUALS = 0. G647644424E+00 


THE RESULTING POSITION IS:* o 

X= 0. 10028T.3393E+O4 V- -0. 4S333798S9E+04 Z-= G 4GJ 949Sft25E+04 

?MS RESIDUALS = 0. 0647490Si96F+G0 






THE PESUl. TING POSITION IS: :jflM 

X= 0 J 0h“'”i2A96'“iE+G4 V— “0. 4878384628E+G4 Z-- G. 4GJ 9?2D2?6E+G4 

" o' “ a 

RMS RESIDUALS -'“0. 06474804376+00 " 




THE RESULTING POSITION'IS : MK*£ ' _ 

X- 0. iG03620147E+G4 V- -G 43?8.579r:S9E+04 Z~ G. 401970046GE+04 


RMS RESIDUALS = 0. 064747S67S.F+G0 


2 


THE' RESULTING POSITION ISl ^ 

0. 10G26S2136E+G4 V- -G. 4:=;3.37S4623E+G4 Z- G. 401928329'iE+04 


RMS'RESIDUAI.S - 0. 0647482372E+GG 


of the 

ORIGINAL Pag^ js pqqr 


THE RESUI ting POSITION IS:" 

X- 0. J 0G3667331E+G4 V- -0. 48"8":7-^:-;'".?E4ri4 Z-- G. •101929 h924F+G4 


RMS RESir-UAl.S == G. 064743344-IF+00 


Appcjxli^ B 


'I\^o tyi‘)ical horclw.'.re flMtimj ix)int luuts 

* tTB by North Star Cciipulors, Inc. 

• KPU by Cyli‘*nir't ic Micro Systons 


mi HAi A sm I r 


I XI CU IION TIMliS i.2. 3 


PRrCISION DIGITS: 

2 

4 

6 

8 

10 

12 

14 

ADD best 

1 

1 

1 

1 

1 

1 

1 

typical 

8‘ 

8 

9 

9 

10 

10 

11 

worst 

10 

10 

10 

11 

11 

12 

12 

SUUTIiACTiH’st 

4 

4 

4 

4 

4 

4 

4 

typical 

8 

8 

9 

9 

10 

10 

11 

worst 

15 

16 

17 

18 

19 

20 

21 _ 

MUl 1 ll'LY I'l'st 

b 

5 

5 

5 

5 

5 

5 

typical 

18 

34 

!.5 

HO 

111 

146 

186 

worst 

51 

1 25 

228 

382 

527 

720 

933_ 

DIVIDE best 

7 

7 

7 

7 

7 

7 

7 

typical 

39 

/O 

109 

156 

211 

274 

370 

worst 

52 

139 

229 

310 

4/0 

621 

7/9 


1 Timi’S given in jniciosecomis 

7 r \t'i ution times .hc i luiu tion of the m|uit v.diies 

H Times listed do not incliiile ti .losnvssion of input vjIoi's .m.f lesolt 


I'e »i(f dimensions; 

Moifel A: 5 in. hy 10 in. 
Model fl: G'i m. by 1? in. 


I'ouei lemiiiemrnts: 

Model A; d V (nniejol.ilevl) vs? 1.7 A 
Model H; 5 V (lejolated) k'i 1.7 A 

Hoard Consti nction: 

I M4 maten.tl, gidd plaled eiUje eonneetois 

I lo.ilmij vv)int (lomlvi i epi I’S' iii.iliim: 

Ryte 1 ; bit 7 sign (1 neij.itive nombei, 0 pi'silive nombei ) 

bits U 0 I'Kponent in exeess t’l binaiy lepiryenialion 
bits 7 0 /e.o lepiesenis ibe /ein value* 

Ryte 2; bits 3 0 least signifieant daiit of value in R(M1 ei'ding 

bits 7 4 next least significant d'ljit of value 

Ryte n: bits 7 4 most signifiiant dujil I’l value m lU'll coding 

bits 3 0 next most siiimficant digit of value 


I 

I 

I 

» 

i 

t 

t 

! 

I 

j 

\ 



*Snmplo nwi ol the Nnith St.ii I I’M f«)r .i diviMo opi'i.ition with 8 diyit ptocision 
*Iu this «'x.iinplc .njiimt'nts ,iii* in lui'imuy in foim: 

* Most siijni(i»:.»nt hyl*? Irish) ili.jit p.iir 

* Siisoipiont digit p.iirs follinv thr msli 

* rAponent/sitjn hyto follows Ish digit pair. 

* Pointer addiusses tin? «*xpi>nrnl/siijn hyto 
•HC has loft aig pointer 

•DE has right atg pointer 
•HL has losiilt pointer 

* 1 ho I PB leecives its aigiiments hy ‘‘peeking" at the MOHO Inis 
•when the argiimont valines aic loaded to accuimilator. 

•Two jum|ieiahlo "haidwiied" a.Idiesses aio leipiired for signaling the KPB 

•This lotitiiur may he ijt'neiali/e<l to peifoiin any opei.ition, at any precision. 


I mV EDA MSIMT 

MVI A.a'IGrDIVOP 
I.OAX D 

ncx O 
I l^AX D 
OCX D 
l.DAX D 
OCX D 
I DAX D 
OCX D 
LDAX D 
l.DAX B 
OCX B 
l.DAX Q 
DCX B 
I DAX B 
ncx B 
LDAX B 
PCX B 
I PAX B 

• Now the floating Point 
LXI n.l I’DIN 
I mvi I DAX D 
OMA A 
.IP fDIVl 
AN I EMUS 
I DAX D 
MOV M,A 

ncx H 

I PAX D 
MOV M.A 
PCX H 
I PAX D 
MOV M,A 


I his "h.iiilwiied" lefeieneo signals fPB to "wake up' 

.•spei ify pieeision .iinl opei.ition code to H’B 

f \p»Mient/sign hyte i>f light .iig 

Adv.mee piWntei to next hyte 

I e.ist sigmfie.iiU digit p.iii of light .irg 

Advance pointi'r to next hyto 


Most significant digit p.iir of riglit arg 
fxponent/sign hyte id left aig 

I e.ist siijmfie.int digit p.iir of left arg 


Most sigmfii aot ifigit p.iir of left .irg 
Mo. lid is performing the opei.ition 
"M.iidwiied" .iddress for receiving value from I I’B 
l oop w.iiting for eomnletion signal (sign hit) 

1 ho I I’M is ifone whei the sigrt hit heroines "1” 

I onp if sign hit is still "0" 

Chick for eiiot, rondition testeif .it end 
I' xpi'iii'ot/sign of result 
Stoie exponent/sign of tesoll 
Adv.mee pointer. 

I east signilic.int digit p.iir of result 


PCX H 
I PAX D 
MOV M,A 
PCX M 
I PAX D 
MOV M.A 
UZ 

jMP f MMOM 


insh hyte of n'siilt 
Stoie It 

Isctiiin d no erioi w.is ileteeted 

('o lepoit eiiof (i e. imdeillow oi divide hy 0) 


FLOAT ING POINT UNIT 


PRICE LIST 
MODEL 

1 

• 


QUANTITY 

25 

100 

n 

$595.00 


$535.00 

$475.00 

n 

470.00 


425.00 

3/5.00 

#3 

345.00 


315.00 

275.00 


All sales FOB Palo Alto 


EXL CUTION TIMES 
FUNCTION 


TIME IN MILISECONDS (approximate) 


ADO, SUB 

no 

MUL, DIV, SQRT 

225 

TAN 

846 

LN, SIN, COS, -►POL 

1250 

POWER 

1720 


CV3EPA'ET1C /DICRQ 

2460 iZMBARCADERO VMY 
PALO ALTO, CA 94303 
(415) 321-0410 


Api'xjndi^ C 


Tlx? Al’L least squares prxxjriim 


t i ] 5 /first Qjfss FOR iiiiiFR ;•i'»01n.:lH II' • r'l'irsiriM onopriiMorFs 

^ .11 IS MUMBER OF 1 1 C> I'l 1) Ol I' • : fl • I I 1 1 1 I i 1 1 1 1 |i • 1 rRT«R . :-j 
. \ -FL.^;^-FL 


f Ji-.-’lOCE E FROM POl.ES I'll Mi in' hi. I |i:t'i;]|l in 'M.IRr HRs EflRFH ROB I US 
i J H £• 1 1 J M £■“?:= :OEx (R Cl J "0 1 f I . ' ^ n 

i‘ -^riE-r ((ECU A£.i (E CE.] .* ?.i <- iR f 1 - c. • MO . 

r ]• fi u BY 3 iiriTRi:-: Of •■|iii;i I j iTJ ro'.jnriis; u» of uflcicities 
- ' '•S.i R£ 

= v*3j/:0.5 o a 

' IS OCCTOR OF F:ONr.E RillFS MFMMF.EH SniFI L 1 TE OHfi POSITION £• 

‘>.- ExL'.i -^Ra " * 

i'l IS N coMPOMEur oEcroi: or iii:nsiiF:Fn nnii.E rfnes 

■ RBri“RB !■** r ri 

i 5* RES-B I RS<- ( PES;i ->r'RC:S 
« • I -H n 1 :• /END 

riPi.iLMTE iiorpix OF FT.SJilUiiL. ri'Uinioiis 

• O O-pEj ERB-HpAE ■■■• 

■ - • *i M r-Tiou<- (+ • [ 1 1 I i:i r i 1 ,-i 1 

■ .'ll .. I iL.N r£ spuER I L fiL-coR 1 r;s 1 rm i F:rii isf'i. t : i ii 1 1 1 oi i : 

; • : p p (~B c 1 J xE cs J ) .« ( “p CP 1 ;i ? ( -p cs j xp c s 1 1 ? t: c 1 1 » cio i;p c i j aej f p cpj a 


n.'.r soijoRES soLurioN or RESinuoL roUiinoNs 


■ ^ .XRESQM 


•R.iS RESIDUllLS: '* l f+/RFS*:EHPPFSJ aO. S. 
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Microprocx!ssor Utilization in Search i Rescue Missiems 

nUAL W-:iK)RT 

*_ - ;; 3H t 

- * - V< 1 1 

in 

; The position of an ennergcjncy transmitter imy be determined by measuring 

D . 

the Doppler shift of the distress signal as received by an orbiting satellite. 

o 

q 

This requires the oorputation of an initial estimate and refinement of this 

estimate tJuaDugh an iterative, nonlinear, least-squares estimation. 

a 

^ A version of the above algoritlim was Inplcmented at Goddard Space Flight 

B a 

Center (GSPC) and tested by locating a transmitter on tlao premises and ctotaining 

hi 

observations from a satellite. Hie oonputer used was an II3M 360/95. The po- 
sition was determined within the desired 10 km radius accuracy. 

Hie purpose of this project is to determine the feasibility of performing 

the 55ane task in real time using microprcxx 2 ssor teclinology. The least squcuo 

b 

algorithm was inplimentod on cin Intel 8080 microprocessor and the same expori- 

□ 

ment was run as at GSIXT. 

Hie results indicate that a microprocessor con easily match the IBM im- 



Intrcjductionj 



pionontation in accuracy and be performed inside the time limitations set 




Microi')rocx!ssors : 

TJjns Is an inplicit restriction in any; search and rescue mission. The 
use of satellites and oenputers is dictated by that time limit. Ihe use of 
big ocrpjiter to detennine the position presupposes ccrnnunication between the 
satellite and the ooitputer^ 'Ibis oemnunication introduces a time delay since 
the :5atellite is not always within r.adio visibility of an installation thit 
passesses both tlie oormiinication and corputing power for this problem. Pur- 
tliomore the result has to be forwarded to a ooimnnd center to do the dispatching 
Micrtprocessor utilization can alleviate this situaticai in two ways: 
by giving cheap oorputing power to oomunication facilities or by incorporating 
the oorputing power in the satellite itself thus cluninating this oonmunication 
conplotely. 



Microprocessors offer light vx?ight, snail volume, low power processing. 
Their speed is improving rapidly and their cost is going down. They are the 
logical cJxjice for a satellite search and rescue system if they can perform. 






Confi<j \i Ml ion: 

Strictly i^ivoking thprc arc Uuog micn5pirKc:?:y~>r (.cnfigurai ions in 
lliis project wliidi we are going to discuss individiol ly. 

• I\!wloiT.»:'nt sysli jn 

• Mininvrl oxi-Hailion system 

• /W'lual field (onfigxiral ion 

Initially our develoiTr:mt sy.slfTn consislf'fl (jf iin Mns-80 Intel lee micro- 
('onputer Ijy Intel with 16k bytes of 1<AM rviToiy .uid a resicV'nt lOM nonicor. 

M)st of tlie flo.it ing i»int puc'V. ige v.\js dovi'U'L^d in rod line loivjuoge c'ln that 
system using tic monitor's limit ixi hi'x.idix:imil «\iitor and ddiugger. ilc need 
f('>r note lophistic.Jtion beceme a>>ii\rcct. After S'-'vxual failuves in if>iiloi;ing 
altemilives (as fam.-y as Inoking up to a fDP 11 thro-agh a tiliplyine line 
for lure stoiage) wo wi'te able to aajuire a d»v\l floppy diak drive by Intel. 

A .‘^pire line* piioler wui at larlvil to Uvi system with mirvu- hinl..'re ni’xlifiea- 
t i(;ns and 16k byU’S muc i;.\M w. ie add'd in ord<*r to eupp-'it. 'Hio oiilianced 

syr.tem Ivid tdo p?wcr of a mini-exm^niler in :oriwaro (ae-i-uiibli *r, editor, li.- 
biary :a»nager, linkage ixlitor, liador, ai'd a saffiei-’nt file ii.'Uig.'ij at a 
sp ni wiiich was slow !)ut .ic»;epLable. 'Hie fl( Kiting joint iviekage was converted 
to .is;-, rhly language, and t'wo iiore piokages wi*re di'velojx'd; Uvi I/O jviekKje 
ai>i tl,t> ;:atrix n.mipulalion jviekago. Una':;* -ct i d iK'lp e uie fiom tio ir;e of 
lii:-80 (In Circuit Ifuilalor) , d. -signed for a different application, as a 
l*v«irul jymlolic debugger substituting for ti>.’ mxiitiu* b.o>;ad<xn]ml debugg-er. 

Oal of tins final v«^rsion of the ilevelcpi nt systim only a limit id 
.u.ount: of ji“'om.x^s wi'io ir-uxi for the final run. ’ilni-o ddiiv? tl>i minimil 
exi-cution syfitem. 'llie disk was only uswd for injnit of '.lata. Tlie es^;ential 
ji.iitij were; 

• 'lie Q’U emrd 


• 1( k 's of n» •;:! >Ly 




Itc (ictual field configuration would be the some if the nv^chlne were 
located on the gnxnd. Sonc kind of oomrun lent ions cquiiwnt would be re- 
quired to provide the data input aixJ, mybe, start the run autonvitically. 

The configuration would be different, though, if the machine were located on 
the satellite. The requirenents for the satellite configuration would be: 

• The CPU card 

• 16k bytes of m^jiory 


Ail interface that can load the infomotion in memory 
A means to oomTui^ic.ate Ukj result to tj.e world 
Pow'r suivly: 12V, 5V, -5V, ground 





nx? Fl oatin g Point PjcJcage : 

Based cwi estinutes of the number of operations required vge were in- 
clined to think that any floating point operations would have to be performed 
by Ivirdware and not by software since estiitubed tines become prohibitiW. 

^^us floating point package was developed to help us count the actual number 
of operations rather than perform *-hnin in an actual situation. The final run 
pravod our estinntes wrong and the package gained new inportance. 

TTiere are a number of representations of floating poing nuniaers dif- 
fering in accuracy end range as a trade off to the luinher of bytes required 
l^er number. Tlie one ased was the ANSI format for lOKTRAN which happens to 
be inplemented by hardware as an option in IBM carputers. It consists of one 
sign bit, a seven bit oponont (excess 64), and a 24 bit nuntissa of hcxa- 
d-'ciiml digits. The accuracy is 6 hoxadecinul digits or approximately 7.2 
dt?cinal digits. Specific operations wer'^ not timed although a more general 
timing analysis appears in a later section. This format was chosen as opposed 
to tJie BCD ionrat because tlie siacf cquirenents are Icxver for the same 
amount of precision, wiiich in turn reduces execution time slightly. A man- 
tissa of binary digits was not used because of the frequent need for nonrnli- 
7.ation. 

Addition and subtractim take exactly the sane tine, whereas multipli- 
cation is approxinotely equal to 22 addition and division is approximately 
60 additions. 

Multiplication produces a 48 bit result mantissa whi t is then nonnilizcd 
and rounded to 24 bits. This preserved the number of slgnifir.mt digits, or, 
vii^vod from a different angle, is the same as a double precision mi .tipiy if 
the two argum:!nts vsore expanded with zero fill. 

Division preserves tJxj significant digits again by c:qMi''ding Uio n/in- 
tissa of tJie dividend to double precision and results in full single proci- 
e.ion result. Nriinial i'/at ion and rouniling occur as in irrolliplication. 







Acc'ur.icy is thus pivixu-wx! U5 ti^io simjlo pivoision thrnvxjlmut in a 
niHunically stable miiiner kcopimj the loivjth of Uio nurrbor tx> 4 byb>s. T1>e 
exist is expensive multiplication and, oxixxrialli*, division. This dictates 
a pixxjiamniixj style vdx'reby division is avoided vuiless it is absolutely neccs- 
i>ary. Tlie benefits, on the oUier liand, are nunerically stalile inplonenta- 
tions wi»se results irvitch the double precision to Ux> extent possible as 
will be Sixai when Ux3 results of the run ate .inalyzod. 

The square rxx>t function was inplenentod by using a variation of Heron's 
formula based on tJie observation tliat the mantis.sa of vxny flo.it ing 1 x 5 int num- 
ber will Ivive a value of 1/16 to 1 (intorxnebxi as a fraction) . As a first 
guess an approxinvatioi to a straight line connecting the two end points is 
mule. Fxperiirrai tally, six itr'iations w*re found necessary to produce an 
accurate result. A Ijetter first guess could improve tint significantly, but 
time constraints did ivat allow us to piursue Uiat direction. 

Fiiwlly, input and output of floating ixaint numlers turn out a mich 
nnre serious task tlvan first exiiected. The input routine ixxxvjnizcd ntimix^rs 
with a maximum of ten integer and ten fraction digits. Tliis ptx>V3d note 
th.an sufficient for omr nctxls. Tire output routine prcxliices a rigid scienti- 
fic format with 10 fraction digits. Wicn interpreting tJie n^sults it slx^uld 
le kept in mind that at nost only 7 are significant. The fornet was rc- 
tainol in case of future ex^xansion of tire mantissa. Both the input and output 
ixaut ines could be better, but since their functior is only tangential to the 
project at liand tliey wie kept on tire bare functional level. 


Mat rix Operations ! 

All natxic3es in the system are defined as two dimensional, including 
vectors. The first two bytes contain the number of rows and the number of 
columns in the particular matrix, respectively. This effectively limits the 
number of observatiens to 256. Vectors Ivave one of their dimensions identi- 
cally equal to 1. The next two bytes contain the address of tlv 2 first byte 
that follows the last byte belonging to the matrix. Adjacent elements in 
a row of the matrix cure stored as adjacent floating point numbers in memory. 
Rjws are stored sequentially starting from the first row in the fifth byte. 

In tin effort to minimi 2 e the number of address calculations in the least 
squares algorithm the APL program we were sr^lied with, (LSQ) , was converted 
into FORTRAN. Ihe calculations involved in the residual equations W 2 re all 
grouped together inside one big loop. The advantage of such a scheme is that 
OTce an offset is calculated it can be used to address all the needed elements 
of the mtiurices involved in tJie calculation. Wien the time came though, to 
ijtplc3T»?nt it using 8080 assembly language, it became all too apparent that 
there were too mtmy addresses to keep track of and too few registers to help. 
Therefore, due to the limitation of addressing capabilities, routines were 
implemented for the various matrix operators in APL. This resulted in wall 
stnjcturod and very efficient code, the style being dictated by the instruc- 
tion set. 

A minimum number of matrix utility routines was necessary. Matrices can 
be created by specifying their dimensions, they can be filled with zeros, they 
can bo read from a device, they can be moved (copied) in storage. 

There are four classes of operations by which matrices nuy be altered 
involving the following arguments. 

• a oDnstant and a matrix 


• a vector and a matrix 


• two iratxices (plus ix)ssibly a result nvitrix) 

• one nutrix (for cxanple, inversion) 

In our particular application tlx^re was only one inversion of a 2 by 2 
imtrix involved. A sbiple al^rithm derived from Euler's noUrod is imple- 
mented using fixed piwts.. Executim time and temporcary storage are opti- 
mi/rd. 


T n^^l cnr-'ntj ng tlx* Exporim o-nt; 

Having developed tho tools tliat wre disca^sed in previous sections the 
actual inplcmentation was straight forward. Fbr reasons already mentioned a 
rautine was written to match the I£Q routine* developed by Dr. Marini '^nost 
statement ty statement, nie correspondence is indicated in the source pro- 
gram by keeping track of tlie APL statejnent ninTbers. Tlie array names were 
kept tlie same as mach as possible and only one additional tcnix)rary natrix 
was required. Hie program was written for a maximum of 100 obfjorvations. 

All iTQtrix oi>?rations as \Nell as tho sejuare rot:>t kec^ track of the calls to 
Uie f loci ting point routines. 

The v^tiole ixickago makes limited use of t^^o monitor routines, which can 
easily be elhiiinatod. 'fix? reai^on tlvcjy are tliere is lxx:auso software was 
being develc^xxl in nuchine language and the manitor provided a lot of needed 
help. So, essentially, IjSQ can be run caiplctely independently. 

Hie s[xace lax^uinmiints for Uiis particular run wus aiproximtely 16k 
bytes, out of wliich 4k cuuld bo in lt)M aiid 12k in lv\M. Tlic exact nunhers 
are as follo\'«; 


Code: 

3656 bytes 

Data; 

10365 bytes 

Stack: 

100 bytes (arbitrarily) 

'hotal: 

14121 bytes 


Inoorxxarated into the package were four counting routines that kept 
track of the nuniber of additions, subtractions, multiplications and divisions 
rcv]\iirod durhig each iteration. Tlie results will be cijialyi:cd in the next sec- 
' ion. The actual hiplojientation VA.iuld not rctjuire tlicse noutines. Hie couiiting 
overhead to each *iri timet ic oieration is approximately equal to half the line 
of on addition. 


* S<e Ai^i>.'ndix C. 


Tnt.o r proting the Results ; 

Hie fined run exanverged and yielded five digits of accuracy. If conver- 
gence is defined as a ratio of two succesive RMS residuals being close to 1 • 
(in absolute) it was attained at the ninth iteration to within 0.00001. 
Caqxiring these results to the run at GSFX: (run at double precision, or 16 
digits of accuracy) we note the 5 digit accuracy of our result. 

Numerical analysis gives us enough tools to justify the loss of two signi 
ficant digits in the course of the iterations, llie main source of error ap- 
pears to be the subtraction of the estimated range rates from the actuals. 

'Hie subtraction of the average residinl equations could could contribute to 
the error as well. 

The measured execution tune for this i»rticular run was 62 seconds per 
iteration. Tlie microprocessor used was an 8080A by Intel. Adjusting for 
counting tlie number of operations the true time becomes 61 seconds. Ihe 
8080A Q^U has a cycle tiiie of 2 microseconds. If tliis system were actually 
inplementcxl, tJie 8030A-1 CPU could be used which offers higlier s^x^ed with 
cycle time of 1.3 mic.oseconds wliich could bring execution time down to 40 
seconds for each iteration giving approximately 6 minutes to roach conver- 
gence. This figure is derived with no modification of the software. Since 
it falls within our difinition of "real time", wliich was around 15 minutes, 
it is definitely a workable solution. 

Another alternative is, of course, to use hardware floating point units. 
Tao units that we are familiar with indicate a dis^xurity in execution times 
of several orders of nmgnitude. Their specifications apj^joar in Appendix B 
for tJie purposes of the following analysis, 'topical' execution times for 8 
digits of precision of the North Star Qaiputers, Inc. FTB unit were used. 

Our system indicated the following frequency of floating ixiint operations 
for each iteration; 


AJditions - 3137 



Subtractions - 672 


Multiplioitlons - 2382 
Divisions - 940 

i'Jhen trying to cotrputG the tuna it would take to execute those instruc- 
tions vgo noticed that Uie tiina it takes to access hardware floating point 
unit is iTore than twice than the tine it takes to do the calculations. Namely, 
wo came up with the following nunix?rs: 

TIME (SBC) PURPOS E 

0.35 ^x?rform tJie operations 

0.825 input and output the nurrJxjr form 

(8080A-1) 

1.175 total time required 

Therefore, use of hardware units irake it possible to decrease the execu- 

tion time ty one order of magnitude. 


Future Research: 


The poraiieters that have to be optimized in the search and rescue 
mission consist of the accuracy of the position estijiation and tl^ time in 
wliioh it is performed. Proving tlie feasibility of a micrcprocessor ‘ 
ijiplcmantation is far fran devising an cptinal algorithm. 

If the nonlinear regression method is utilized there is a lot of 
room for iirprovement in the inital estiirate, a quantity that can affect 
the wliole outceme of the iterations. Several methods that are suggested 
in Dr. Marini's paper can be explored. Furthermore, since Uie data 
collection takes an appreciable amount of time an algorithm stould be devised 
in which ^m cstinute is upgraded with each incoming datum. If that algorithm 
is good enough then the estinvate could be tlx* result itself. 

A further enhancement on the calculation tiem can te achieved through 
parallelisn. It can appear on two levels: 

• The implementation of the least squares algorithm 

• The grouping of data 

The least s.quares algorithm nay be broken into parallel subtasks that 
can bo perfomed Ijy different processors in piraliel, especially floating 
point Generations. 

Ihe data nvay be grouped in clusters on which the least squares 
algorithm is applied. The estinvate provided by each cluster is tlien processed 
tlarough least squares estinvation itself. 'Ihis method could be applied at 
data collection, time too. 


7\ppond ix A 


• Sairplc run at GSPC 

• Sanple run at Columbia 




•.'JJ ■ • 


! f: 

’ I I'lL.O* 1 • 

.M- Hi:,r ' 

y* ---s _ 

* _*i 1 .** 4 *d -* • .*'•■**.*'*•*- 1 . ' 1 I I » ' • I • 1 * 1 


j :. ;:Q £;M 

' i '■\i|'iLS: u. l:>r. Ti'O 

I? Hlj r l^l*!b. • *'■ U • I i". i' • • I 1 1. ! ' I . 

*'.'1 IS I , 

■ - j 1 A "^SSCi . IT10 J 1 • ' 1 i' 






:.ss 4 




fC' ti:H 


! .•i'ils: m. -jt- r 4 sr; 


NA.SA nnnplG run at CSb\3 




• £‘H 

’ r : Tils : 0 • u r>jS 4 i'i 

I |l.i r MKlr. • i' c* 1 *;L 4 i" J r!i 

.* " r* 4y -''r» • c *1' 1 i' *r* !■ , • i J t,*i" i .Vi* 

I ] -i '5 

■•JSl 




' l-’f.i"! £N 

"LTILS s U • TSSO r i 

« Hi:ir MRS: Si'. - ; Ti: .'i • J.hTJ. <.r:n- 


M ro 1 SF 


■ .I.'ILS: Jt- 

• HLjF I-IRR: Jf-'" -" J . "■•1 . /iR • i.ij Jj 


' L - T ) SR 


1 * 1 ji J r 111 " I". • ■. j» I.. I't '.i' f I ij 1 ••■•I ij 


* 4 * 1 ' JS • *-iLir*L^* >" • 


■ i::i ^ ^ 

■ Til.'!:: r-. ijS-T-*.: .X S:-:*‘. 

• iir.r i-iRR; ::*j . i ’! ;l’;xr ,■ si . ;’is;V'!.t’ s 


jrS "4:::So. .m . | j. ';*...i:i 1 


■' CH 

.. ’ll.*-.: I J. .;::j S}'* 

• IIlT hRI' : ■"■'•Jl •*:: ■' M': : J . ,'1 .“'Xi.l''. 


REPRODUCIBILITY OF Hi. 
ORIGINAL PAGE IS POUR 


jijit. • i : :*i i 




'.'• i»l. I'ti.lfiL.';-! 0.r*i;.~. ,*. „■ lj.fO ! J-' 

i’ • ! '• iicr riRiii: .■'. r* 1 1".''.‘ ’"I'l": , •< . , i r* "m ■. i 

‘ I ! ’ ! 1 :;■ : 

'* *• ' I* 1* . *cM c* ."■ I 1 J • ■ 1 ; ,*t 1 


■•■' I-Jl 

.' 'i. .'r : 0 . Ot--' r-j 

".I mR£: ::•?•. ::j :.0::.n;:il 1 . .’1 

• • 

i ' • 

• .* 'Jtiii'y-l i' • " ' J • i’’ '-i' i 


r* • • k^L**‘r I ‘4'!**.^ „'oL* 

1**11 Ml’*tS *1 - ' 1 . 1’*', .‘i 

y - • 

• • • 


UH^KUDUCQilLaT OF 
UHIGINAL PAGE IS PO- )R 




I HK. I'Kii-'L 1 1 NO lOi^rr lori li:; 

C.'jl-ISiiT^SiriE+O? V= -0. -4 


!■•: '0O.^'8T‘r6iE+ 0‘l 


.svuiiJlo run at Ojluini)ia 

0. 1 3 


PllS "RKSIC'UftLS'a" 0? ll5‘461G‘VO?E+01 ■ 


THK’PESULTING POSITIOrr iSr' 

■•:= 0 *?2:.lt.947555E+03 V- -0. 4912r5i754rE+04 2-- O. 41 7»:"8.?2»r"T'2E+04 

*:MS PESINJfiLS ="~oriGT.O5S20*;7E+0i 


rHE’F:Fi::uLTir4Q POSITION IS:’ ' 

0. S4e45872.ME+03 V- -0. 4039o739?SE+04 Z- O. 4O^‘O^8883*:’.E+04 

« 

?NS"PE5IDUflLS '=* ' 0: 1389477729E+00 


THE PESULTINO POSIT I uN IS :' ‘ " 

0. 9470705 J.27.E+03 V= -0. 4:55‘‘'S.24490E+04 7 - 0 4O3“9412t:0E+04 

vl-lS'PESir-UfiLS 0. 0706402.1 llE+00 

t 


THE RESUI.TINQ POSITION ISr 

X- 0. 9:r:7.T0 .00t.:30E+03 V- -0. 4S35M07O16E+O4 Z •• O 4025446085F+04 

I 

TMS PESIPURLS - ~ 0. O^SOZ^c-SlSF + OO . “ ‘ 


fHE PESUL TING POSITION IS: 

0. 999964714OF+05 V- -O. 487 85 7.="'>'06F+04 Z- 0 402O255 .1 '54E>04 
H'IS RFSIC'UHIS - 0. nS47644424F-‘M0 


THE PhTUl.TTNG POSITION IS: 

y,— 0 i002c'T'3 59ZF+04 'r*— “0 485 5^ ^7'^'5Ti9E'*'04 Z-- O 4u 1 94 ^882SF + 04 

-^KiS PFSIPI.IRLS = 0 064 74900967-^00 


THE PESm TING POSITION IS: 

/!— 0 ,1 * 'K*_ *T*“ "li *4:-i ■ -“i ^ 

PMS PFS ] r-URl S - 0 06474804877 <-60 


6 + 1-..1 


•3 40 J ‘^5 .* '.''6E-*-04 


THE RESULTING POSITION IS; 

0 1 0036.20147F-*-O4 V-- -‘■'t 48"R’ 7^=:"'9P^-04 .7- 0 40 19“ 004 6r,r +04 

ri-lS PFSIC'URI. S = 0. O64747:-:67‘‘.K-*-00 


THE PFSUI TING POSITION IS; 

y,~ 0 1003 682J ”6p ^ 04 V - ~*"V 48“>:“ 8467'8E-* 04 .7 - 0 40J 9.?W5'.Z’ ■<4E+04 


FiJS PKSim.lRI.S 


O 064 74 8287. ■•F > 00 


ITY OP T«B 

^ IuEiNaI, 1/7:0 VA l’fV)R 


THE PES"I TING POSITION IS- 

X- 0 I 06“ .-6783 IF ♦-O-l V- -O .l-;".': 7' '’.4* “’4 0 40 I X 64 

PM:S FT-iruril S = 0 O.;.»7.»:E::.l.t.ir'*0*'( 


Appendix B 


TVo typical hardware floating point units 

* FPB by North Star OarpuLcrs, l ie, 

• tTU by Cyberuetic Micro Systems 


n’H DATA SHHiT 


EXECUIION TIMES 1.2.3 


PRECISION DIGITS: 

2 

4 

6 

8 

10 

12 

14 

ADD best 

1 

1 

1 

1 

1 

1 

1 

typical 

*8 

8 

9 

9 

10 

10 

11 

worst 

10 

10 

10 

11 

11 

12 

12 

SUBTRACT best 

4 

4 

4 

4 

4 

4 

4 

typical 

8 

8 

9 

9 

10 

10 

11 

worst 

15 

16 

17 

18 

19 

20 

21 

MULTIPLY best 

5 

5 

5 

5 

5 

5 

5 

typical 

18 

34 

55 

PO 

111 

146 

186 

worst 

51 

125 

228 

382 

527 

720 

933 

DIVIDE best 

7 

7 

7 

7 

7 

7 

7 

typical 

39 

70 

109 

IbG 

211 

274 

370 

worst 

62 

1.39 

229 

310 

470 

621 

779 


1. Tinu-s uivrn in microicconiJs 

2. execution times aic a function of tf^e input values 

3. TiniiS list'll ilo tiot incluilo ti .msrnisiinn of input v.i'uev .in.f icsult 


Boaio dimensions; 

r.'oilcl A: 5 in. l)y 10 in. 

Model B: C^4 m. by 1? in. 

r.nver lenuiitments: 

Model A; 8 V (unfcguljted) 0® 1.7 A 
Muilel 8: 5 V (icjulated) (?* 1.7 A 

Podiil Construction: 

I Itil mdterial, gold pie. id edge lonnertois 

ri.>jtmg poi It nuinhoi lepiesentation: 

Pytel: bit7‘Sign(1 negative numiter. 0 pi'sitive numlief ) 

bits 6 0* exponent in excess 64 bin.iiy lepu'Si'nlatiun 
l)its 7 0 zero icpieier.ts the /eio value 
Cyte 2; bits 3 0 * least significant digit of v.iluc in BCD ceding 
t.its 7-4 - next least significant digit of value 

Byte n; bits 7 4 « most significant digit of value in BCD coding 
bits 3 0'* next must significant rfigit of value 

All values .lie nomali;ed. 

Oibci lepiesenlations ol BCD floating point niimlnns lequiic a change in miciocode and aio available on 
special Older. 


•Sample use of the North Star fPB for a divide operation with 8 digit precision 
•In this example assume arguments arc in memory in form: 

• Most significant byte (msb) digit pair 

• Susequent digit pairs follow the msb 

• Exponent/sign byte follows Isb digit pair. 

• Pointer addresses the exponent/sign byte 
*BC has left arg pointer 

•DE has right arg pointer 
•HL has result pointer 

•The FPB receives its arguments by "peeking" at the 8080 bus 
•when the argument values are loaded to accumirlator. 

•Two jumperable "hardwired" addresses are reqitired for signaling the FPB 

•This routine may be generalised to perform any operation, at any precision. 


FDIV LDA RSTRT 

MVI A.8*16+DIVOP 

LDAX D 

DCX D 

LDAX D 

DCX D 

LDAX D 

DCX D 

LDAX D 

DCX D 

LDAX D 

LDAX B 

DCX B 

LDAX B 

DCX B 

LDAX B 

DCX B 

LDAX B 

DCX B 

LDAX B 

* Now the Floating Point 
LXI D.FPDIN 

FDIV1 LDAX D 
ORA A 
JP FDIVl 
ANI EBITS 
LDAX D 
MOV M,A 
DCX H 
LDAX D 
MOV M,A 
DCX H 
LDAX D 
MOV M,A 
DCX H 
LDAX D 
MOV M,A 
DCX H 
LDAX D 
MOV M.A 
RZ 

JMP ERROR 


Specify precision and operation code to FPB 
Exponent/sign byte of right arg 
Advance pointer to next byte 
I east significant digit pair of right arg 
Advance pointer to next byte 


Most significant digit pair of right arg 
Exponent/sign byte of left arg 

Least significant digit pair of left arg 


Most significant digit pair of left arg 
Board is performing the operation 
"Hardwired" address for receiving value from FPB 
Loop waiting for completion signal (sign bit) 

The FPB is done when the sign bit becomes "1" 
Loop if sign bit is still "0" 

Check for error, condition tested at end 
Exponent/sign of result 
Store exponent/sign of result 
Advance pointer. 

Least significant' digit pair of result 


msb byte of result 
Store it 

Return if no error was detected 

Go report error (i.e. underflow or divide by 0) 


FLOATI NG POINT UNIT 


PRICE LIST 
MODEL 

1 

QUANTITY 

25 

100 

n 

$595.00 

$535.00 

$475.00 

ftZ 

470.00 

425.00 

375.00 


345.00 

315.00 

275.00 


All sales FOB Pale Alto 


EXE CUTION TIMES 
FUNCTION 


TIME IN MILISECONDS (approximate) 


ADD, SUB 

110 

MUL, DIV. SQRT 

225 

TAN 

846 

LN, SIN. COS.-^POL 

1250 

POWER 

1720 


rvSE.^AIETlC ZDICPQ 5V5TF/3J5 

2460 EMBARCADERO WAY 
PALO ALTO, CA 94303 

(415) 321-0410 


Al^xsndbc C 


Th3 APL least squares program 
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